Speaker Recognition Using Principal Component Analysis

نویسندگان

Peilv Ding

Liming Zhang

چکیده

This paper proposes a new feature vector— Mel Frequency Principal Coefficient(MFPC), applied to speaker recognition. It is derived by performing Principal Component Analysis on the Mel Scale Spectrum Vector. Compared with conventional Mel Frequency Cepstrum Coefficient, MFPC efficiently exploited the correlation information among different frequency channels. These correlations, which is mainly caused by the vocal tract resonance, have been found to vary consistently from one speaker to another. And we select these feature coefficients according to their Fisher Ratio, which will guarantee the largest discriminability between classes in the given dimensionality. Finally, we implement a textindependent speaker recognition system. It uses Vector Quantization to design codebooks of given reference speakers. The experiment results demonstrate that our proposed feature vector has characteristics of compactness, large discriminability and low redundancy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker recognition using supervised probabilistic principal component analysis

In this study, a supervised probabilistic principal component analysis (SPPCA) model is proposed in order to integrate the speaker label information into a factor analysis approach using the well-known probabilistic principal component analysis (PPCA) model under a support vector machine (SVM) framework. The latent factor from the proposed model is believed to be more discriminative than one fr...

متن کامل

Speaker recognition using MPEG-7 descriptors

Our purpose is to evaluate the efficiency of MPEG-7 audio descriptors for speaker recognition. The upcoming MPEG-7 standard provides audio feature descriptors, which are useful for many applications. One example application is a speaker recognition system, in which reduced-dimension log-spectral features based on MPEG-7 descriptors are used to train hidden Markov models for individual speakers....

متن کامل

On the Use of Gaussian M Speaker Variabili

Analysis and modeling of speaker variability is important to help understand in-depth inter-speaker variances and to enhance current speech/speaker recognition system. In this paper we introduce adapted Gaussian mixture model (GMM) based speaker representation for the task. Two powerful multivariate statistical analysis methods, principal component analysis (PCA) and independent component analy...

متن کامل

Understanding Speaker Variability Using Correlation-based Principal Component Analysis Thesis Proposal

In this research, we study the relationship amongst speakers and diierent sounds in speech. We propose a new speaker normalization/adaptation model which incorporates correlations amongst phoneme classes, and explore the applications of the model. Using principal component analysis we construct a speaker space based on a speaker covariance matrix obtained from the training data. The speaker cov...

متن کامل

ACOUSTIC MODEL ADAPTATION FOR AUTOMATIC SPEECH RECOGNITION AND ANIMAL VOCALIZATION CLASSIFICATION by

ACOUSTIC MODEL ADAPTATION FOR AUTOMATIC SPEECH RECOGNITION AND ANIMAL VOCALIZATION CLASSIFICATION Jidong Tao, B.Eng., M.S. Marquette University, 2009 Automatic speech recognition (ASR) converts human speech to readable text. Acoustic model adaptation, also called speaker adaptation, is one of the most promising techniques in ASR for improving recognition accuracy. Adaptation works by tuning a g...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2001

Speaker Recognition Using Principal Component Analysis

نویسندگان

چکیده

منابع مشابه

Speaker recognition using supervised probabilistic principal component analysis

Speaker recognition using MPEG-7 descriptors

On the Use of Gaussian M Speaker Variabili

Understanding Speaker Variability Using Correlation-based Principal Component Analysis Thesis Proposal

ACOUSTIC MODEL ADAPTATION FOR AUTOMATIC SPEECH RECOGNITION AND ANIMAL VOCALIZATION CLASSIFICATION by

عنوان ژورنال:

اشتراک گذاری